Max-Prob: An Unbiased Rational Decision Making Procedure for Multiple-Adversary Environments
نویسندگان
چکیده
In binary-utility games, an agent can have only two possible utility values for final states, 1 (win) and 0 (lose). An adversarial binary-utility game is one where for each final state there must be at least one winning and one losing agent. We define an unbiased rational agent as one that seeks to maximize its utility value, but is equally likely to choose between states with the same utility value. This induces a probability distribution over the outcomes of the game, fromwhich an agent can infer its probability to win. A single adversary binary game is one where there are only two possible outcomes, so that the winning probabilities remain binary values. In this case, the rational action for an agent is to play minimax. In this work we focus on the more complex, multiple-adversary environment. We propose a new algorithmic frameworkwhere agents try to maximize their winning probabilities. We begin by theoretically analyzing why an unbiased rational agent should take our approach in an unbounded environment and not that of the existing Paranoid or MaxN algorithms. We then expand our framework to a resource-bounded environment, where winning probabilities are estimated, and show empirical results supporting our claims.
منابع مشابه
Decision-Making Styles and Attitude Towards Substances: Predictors of Potential Addiction in Adolescents
Objective: In all societies, adolescents are the most vulnerable age group to addiction. Decision-making styles and attitude toward substances can play an important role in the tendency of adolescents to addiction. The aim of the current study was to investigate the role of decision-making styles and attitude toward substances in predicting the potential addiction among adolescents. Methods: I...
متن کاملHull Performance Assessment and Comparison of Ship-Shaped and Cylindrical FPSOs With Regards To: Stability, Sea-Keeping, Mooring and Riser Loads In Shallow Water
Floating, Production, Storage and Offloading “FPSO” have become a popular choice since 1980s for marginal and fast-track developments where subsea pipeline is not an economic or feasible solution for export. Field development usually starts with a concept selection procedure which is constituted from a sequence of multi-disciplinary decision making tasks. As limited data is availabl...
متن کاملActing Irrationally to Improve Performance in Stochastic Worlds
Despite many theories and algorithms for decision–making, after estimating the utility function the choice is usually made by maximising its expected value (the max EU principle). This traditional and ‘rational’ conclusion of the decision–making process is compared in this paper with several ‘irrational’ techniques that make choice in Monte–Carlo fashion. The comparison is made by evaluating th...
متن کاملEffects of Interruptible Load on Decision Making of a Distribution Company in Competitive Environments
The main goal of this paper is to present a new day-ahead energy acquisition model for a distribution company (Disco) in a competitive electricity market environment with Interruptible Load (IL). The work formulates the Disco energy acquisition model as a bi-level optimization problem with some of real issues, and then studies and designs a Genetic Algorithm (GA) of this optimization problem to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011